Combining Active and Ensemble Learning for Efficient Classification of Web Documents

Authors: Steffen Schnitzer, Sebastian Schmidt, Christoph Rensing, Bettina Harriehausen-Mühlbauer

Polibits, Vol. 49, pp. 39-45, 2014.

Abstract: Classification of text remains a challenge. Most machine learning based approaches require many manually annotated training instances for a reasonable accuracy. In this article we present an approach that minimizes the human annotation effort by interactively incorporating human annotators into the training process via active learning of an ensemble learner. By passing only ambiguous instances to the human annotators the effort is reduced while maintaining a very good accuracy. Since the feedback is only used to train an additional classifier and not for re-training the whole ensemble, the computational complexity is kept relatively low.

Keywords: Text classification, active learning, user feedback, ensemble learning

PDF: Combining Active and Ensemble Learning for Efficient Classification of Web Documents
PDF: Combining Active and Ensemble Learning for Efficient Classification of Web Documents